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Delivery of nucleic acids into eukaryotic genomes using in 
vitro assembled mu transposition complexes 

The present invention relates to genetic ©agineering and especially to the use of DNA 
transposition complex of bacteriophage Mu. In particular, the invention provides a gene 
transfer system for eukaryotic cells, wherein m vitro assembled Mu transposition 
complexes are introduced into a target cell. Inside the cell, the complexes readily mediate 
integration of a transposon construct into a cellular nucleic acid. The invention further 
provides a kit for producing insertional mutations into the genomes of eukaryotic cells. The 
kit can be used, e.g., to generate insertional mutant libraries. 



BACKGROUND OF THE INVENTION 

Efficient transfer of nucleic acid into a target cell is prerequisite for the success of almost 
any molecular biology application. The transfer of nucleic acid into various types of cells 
provides means to study gene ftinction in living organisms, to express exogenous genes, or 

15 to regulate cell functions such as protein expression. Stably transferred inserts can also be 
used as primer binding sites in sequencing projects. In principle, the transfer can be 
classified as transient or stable. In the former case the transferred genetic material will 
eventually disappear from the target cells. Transient gene transfer typically utilizes plasmid 
constructions that do not replicate within the host cell. Because vector molecules that 

20 would replicate in mammalian cells are scarce, and in essence they are limited to those 
involving viral replicons (i.e. no plasmids available), the transient transfer strategy is in 
many cases the only straightforward gene transfer strategy for mammalian cells. For other 
types of cells, e.g. bacterial and lower eukaryotes such as yeast, replicating plasmids are 
available and therefore transient expression needs to be used only in certain specific 

25 situations in which some benefits can be envisioned (e.g. conditional expression). 



In many cases stable gene transfer is the preferred option. For bacteria and lower 
eukaiyotes plasmids that replicate within the cells are available. Accordingly, these DNA 
molecules can be used as gene delivery vehicles. However, the copy numbers of such 
30 plasmids typically exceeds one or two and therefore the transferred genes increase the gene 
dosage substantially. Typically used plasmids for bacteria and yeasts are present in tens or 
himdreds of copies. Increased gene dosage compared to normal situation is a potential 
source of artefactual or at least biased experimental results in many systems. Therefore, it 
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would be advantageous to generate situations in which single-copy gene transfer (per 
haploid genome) would be possible. 

In general, stable single-copy gene transfer can be achieved if transferred DNA can be 
5 inserted into the target cell's chromosomal DNA. Traditionally, this has been achieved by 
using dififerent types of recombination reactions. In bacteria, homologous recombination 
and site-specific recombination are both widely used and in some cases yet less well 
characterized "illegitimate" recombination may be used. The choice of a method typically 
depends on whether a random or targeted mutation is required. While some of these 
10 methods are relatively trivial to use for a subset of the bacterial species, a general-purpose 
method would be more desirable. 

Recombination reactions may also be used to stably transfer DNA hato eukaiyotic cell's 
chromosomal DNA. Homologotis and site-specific recombination reactions produce 

15 targeted integrations, and "illegitimate" recombination generates non-targeted events. 
Utilization of transpositional recombination has been described for baker's yeast 
Saccharomyces cerevisiae (Ji et al 1993) and for fission yeast Schizosaccharomyces 
pombe (Behrens et al 2000). These strategies involve in vivo transposition in which the 
transposon is launched firom within the cell itself They utilize suitably modified 

20 transposons in combination with transposase proteins that are produced within a given cell. 
Similar systems, in which transposase proteins are produced within cells, are available also 
for other eukaryotic organisms; typical examples include Drosophila and Zebra fish 
(Rubin and Spradling 1982, Raz et al. 1997). 

25 While transposition systems based on in vivo expression of the transposition machinery are 
relatively straightforward to use they are not an optimal choice for gene transfer for various 
reasons. For example, efficiency as well as the host-range may be limited, and target site 
selection may not be optimal. Viral systems, especially retroviral insertion methods, have 
been used to generate genomic insertions for animal cells. These strategies also have some 

30 disadvantageous properties. For example, immime response may be elicited as a response 
to virally-encoded proteins, and in general, constructing safe and efficient virus vectors and 
respective packaging cell lines for a given application is not necessarily a trivial task. 
Therefore, also for eukaryotic cells, a general-purpose random non-viral DNA insertion 
strategy would be desirable. Introduction of in vitro-assembled transposition complexes 
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into the cells may be a choice. It is likely that utilization of in viYro-assembled DNA 
transposition complexes may be one of the most versatile systems for gene transfer. 
Recently, such a system for bacterial cells has been described and it utilizes chemical 
reactions based on transpositional DNA recombination (US 6,159,736 and US 6,294,385). 
5 Efficient systems are expected to provide a pool of mutants that can be used various ways 
to study many types of aspects of cellular life. These mutant pools are essential for studies 
involving whole genomes (i.e. functional genomics studies). However, a priori it is not 
possible to envision whether in vzYro-assembled DNA transposition complexes would work 
when introduced into eukaryotic cells, especially if the components are derived from flie 

10 prokaiyota. The difference between prokaryotic and eukaryotic cells, especially the 

presence of nuclear membrane and packaging of eukaryotic genomic DNA into chromatin 
structure, may prevent the prokaryotic systems from fimctioning. In addition, in view of 
the stability and catalytic activity of the transposition complex, conditions within 
eukaryotic cells may be substantially different from prokaryotic cells. In addition, other 

1 5 unknown restriction system(s) may fight against incoming DNA and non-specific proteases 
may destroy assembled transposition complexes before they execute their function for 
integration. Furthermore, even if the traospositional reaction integrates the transposon into 
the genome, the ensuing 5-bp single-stranded regions (and in some cases 4-nt flanking 
DNA flaps) would need to be corrected by the host. Therefore, it is clear that the stability 

20 and efficiency of transposition complexes inside a eukaryotic cell cannot be predicted from 
the results with bacterial cells as disclosed in US 6,159,736 and US 6,294,385. Thus, to 
date there is no indication in the prior art that in v//ro-assembled transposition complexes 
can generally be used for nucleic acid transfer into the cells of higher organisms (i.e. 
eukaryotes). 

25 

Bacteriophage Mu replicates its genome using DNA transposition machinery and is one of 
the best characterized mobile genetic elements (Mizuuchi 1992; Chaconas et al., 1996). We 
utilised for the present invention a bacteriophage Mu-derived in vitro transposition system 
that has been introduced recently (Haapa et al. 1999a). Mu transposition complex, the 
30 machinery within which the chemical steps of transposition take place, is initially 

assembled from four MuA transposase protein molecules that first bind to specific binding 
sites in the transposon ends. The 50 bp Mu right end DNA segment contains two of these 
binding sites (they are called Rl and R2 and each of them is 22 bp long, Savilahti et al. 
1995). When two transposon ends meet, each bound by two MuA monomers, a 
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transposition complex is fonned through conformational changes. Then Mu transposition 
proceeds within the contract of said transposition complex, i.e., protein-DNA complexes 
that are also called DNA transposition complexes or transpososomes (Mizuuchi 1992, 
Savilahti et al. 1995). Functional core of these complexes are assembled from a tetramer of 
5 MuA transposase protein and Mu-transposon-derived DNA-end-segments (i.e. transposon 
end sequences recognised by MuA) containing MuA binding sites. When the core 
complexes are formed they can react in divalent metal ion-dependent manner with any 
target DNA and insert the Mu end segments into the target (Savilahti et al 1995). A 
halhnark of Mu transposition is the generation of a 5-bp target site duplication (Allet, 
10 1979; Kahmann and Kamp, 1979). 

In the simplest case, the MuA transposase protein and a short 50 bp Mu right-end (R-end) 
fragment are the only macromolecular components required for tranq)osition complex 
assembly and function (Savilahti et al. 1995, Savilahti and Mizuuchi 1996). Analogously, 
1 5 when two" R-end sequences are located as inverted terminal repeats in a longer DNA 

molecule, transposition complexes form by lapsing (he transposon ends. Taiget DNA in 
the Mu DNA in vitro transposition reaction can be linear, open circular, or siq)ercoiled 
(Haapa et al. 1999a). 

20 To date Mu in vitro transposition-based strategies have been utilized efficiently for a 

variety of molecular biology applications including DNA sequencing (Haapa et al. 1999a; 
Butterfield et al. 2002), generation of DNA constructions for gene targeting (Vilen et al., 
2001), and functional analysis of plasmid and viral (HIV) genomic DNA regions (Haapa et 
al., 1999b, Laurent et al., 2000). Also, functional genomics studies on whole virus 

25 genomes of potato virus A and bacteriophage PRDl have been conducted using the Mu in 
vitro transposition-based approaches (Kekarainen et al., 2002, Vilen et al., 2003). In 
addition, pentapeptide insertion mutagenesis method has been described (Taira et al., 
1999). Recently, an insertional mutagenesis strategy for bacterial genomes has been 
developed in which the in vitro assembled functional transpososomes were delivered into 

30 various bacterial cells by electroporation (Lamberg et al., 2002). 

E. coli is the natural host of bacteriophage Mu. It was first shown with E. coli that in vitro 
preassembled transposition complexes can be electroporated into the bacterial cells 
whereby they then integrate the transposon construct into the genome (Lamberg et al. 
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2002). The Mu transpososomes were also able to integrate transposons into the genomes of 
three other Gram negative bacteria tested, namely, Salmonella enterica (previously known 
as S. typhimurium), Erwinia carotovara, and Yersinia enterocolitica (Lamberg et al. 2002). 
In each of these four bacterial species the integrated transposons were flanked by a 5-bp 
5 target site duplication, a halhnark of Mu transposition, thus confirming that the integrations 
were generated by DNA transposition chemistry. 

SUMMARY OF THE INVENTTON 

We have developed a gene transfer system for eukaryotic cells that utilizes in vitro- 
1 0 assembled phage Mu DNA transposition complexes. Linear DNA molecules containing 
appropriate selectable markers and other genes of interest are generated that are flanked by 
DNA sequence elements needed for the binding of MuA transposase protein. Incubation of 
such DNA molecules with MuA protein results in the formation of DNA transposition 
complexes, transpososomes. These can be delivered into eukaryotic cells by 
1 5 electroporation or by other related methods. The method described in the present mvention 
expands ttie applicability of the Mu transposon as a gene delivery vehicle into eukaryotes. 

In a first aspect, the invention provides a method for incorporating nucleic acid segments 
into cellular nucleic acid of a eukaryotic target cell, the method comprising the step of: 

20 

delivering into the eukaryotic target cell a Mu transposition complex that comprises 
(i) MuA transposases and (ii) a transposon segment that comprises a pair of Mu end 
sequences recognised and bound by MuA transposase and an insert sequence between said 
Mu end sequences, under conditions that allow integration of the transposon segment into 
25 the cellular nucleic acid. 

In another aspect, the invention features a method for forming an insertion mutant library 
fix)m a pool of eukaryotic target cells, the method comprising the steps of: 

30 a) delivermg into the eukaryotic target cell a Mu transposition complex that comprises 
(i) MuA transposases and (ii) a transposon segment that comprises a pair of Mu end 
sequences recognised and bound by MviA transposase and an insert sequence with a 
selectable marker between said Mu end sequences, vmder conditions that allow integration 
of the transposon segment into the cellular nucleic acid. 
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b) screening for cells that comprise flie selectable marker. 

In a third aspect, the invention provides a kit for incorporating nucleic acid segments into 
cellular nucleic acid of a eukaryotic target cell. 

The term "transposon", as used herein, refers to a nucleic acid segment, which is 
recognised by a transposase or an integrase enzyme and which is essential component of a 
functional nucleic acid-protein complex capable of transposition (i.e. a transpososome). 
Minimal nucleic acid-protein complex capable of transposition in the Mu system 
comprises four MuA transposase protein molecules and a transposon with a pair of Mu end 
sequences that are able to interact with MuA. 

The term "transposase" used herein refers to an enzyme, which is an essential component 
of a functional nucleic acid-protem complex capable of transposition and which is 
mediating transposition. The term "transposase" also refers to integrases from 
retrotransposons or of retroviral origin. 

The expression ^transposition" used herein refers to a reaction wherein a transposon inserts 
itself into a target nucleic acid. Essential components in a transposition reaction are a 
transposon and a transposase or an integrase enzyme or some other components needed to 
form a functional transposition complex. The gene delivery method and materials of the 
present invention are established by employing the principles of in vitro Mu transposition 
(Haapa et al. 1999ab and Savilahti et al, 1995). 

The term "transposon end sequence" used herein refers to the conserved nucleotide 
sequences at the distal ends of a transposon. The transposon end sequences are responsible 
for identifying the transposon for transposition. 

The term "transposon binding sequence" used herein refers to the conserved nucleotide 
sequences within the transposon end sequence whereto a transposase specifically binds 
when mediating transposition. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Mini-Mu transposon integration into the yeast chromosomal or plasmid DNA in 
vivo by in vitro-assembled Mu transposition complexes comprising of a tetramer of MuA 
transposase and a mini-Mu transposon. 

Figures 2A and 2B. Schematic representation of the Mu-transposons used in this study 
with the relevant restriction sites. (2 A) Both of the yeast transposons contain 7EF promoter 
^tef), Aa« marker g«ie and TEF terminator (Ttbf) embedded between two 50 bp Mu right 
end sequences. The kanMX4-pl5A-Mu transposon contains the additional pl5A replicon. 
Short arrows denote the binding sites of the primers used for sequencing of the out-cloned 
flanking sequences. The BglH. sites in the ends are used to excise the transposon from the 
vector plasmid backbone. (2B) The Mu/LoxP-Kan/Neo transposon for transfecting the 
mouse ES cells. It contains kan/neo marker gene between two Mu right end and LoxP 
sequences. The kan/neo marker includes the prokaryotic and eukaryotic promoters and 
terminators as explained in Materials and methods. 

Figure 3. Mu transposition complex formation with KanMX4-Mu (1.5 kb) and KanMX4- 
pl5A-Mu (2.3 kb) substrates analysed by ariose gel electrophoresis. Substrate DNA was 
incubated with or without MuA, and the reaction products were analysed in the presence or 
absence of SDS. Samples were electrophoresed on 2 % agarose gel containing 87 mg/ml 
of heparin and 87 mg/ml of BSA. 

Figures 4A and 4B. Southern blot analysis of the insertions into the yeast genome. 
Genomic DNA of 17 geneticin-resistant FY1679 clones, resulting from the electroporation 
of the transposition complexes into yeast cells, was digested with BaniHl +Bgl II (4A) or 
HindSS. (4B) and probed with kanMX4 DNA. Lanes 1-17, transposon insertion mutants; C, 
genomic DNA of original S. cerevisiae FY1679 recipient strain as a negative control; P, 
linearized plasmid DNA containing kanMX4-Mu transposon as a positive control; M, 
molecular size marker. The sizes of plasmid fragments are shown on the left. 

Figures 5A and 5B. Distribution of kanMX4-Mu integration sites on yeast chromosomes 
(5A) and in the repetitive rDNA region on chromosome 12 (SB). The ovals in (5A) 
designate the centromer of each chromosome. Integration sites in the diploid strain 
FY1679 are indicated by bars, and the integration sites in the haploid strain FY-3 by bars 
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with fiUed circles. Above the line representing yeast genomic DNA are indicated the 
transposons that contained the kan gene in the orientation of Watson strand, below the line 
the transposons are in tiie Crick strand orientation. 

5 Figure 6. Southern blot analysis of HeLa clones transfected with tiie transposon 

complexes. Lanes: 1. Marker with the following bands: 10 kb, 8 kb, 6 kb, 5 kb, 4 kb, 3 kb, 
2.5 kb. 2, HeLa genomic DNA. 3. HeLa genomic DNA mixed with purified MuTLoxP- 
Kan/Neo transposon (about 2.1 kb). HeLA clones: 4. RGC13 5. RGC14 6. RGC15 7. 
RGC16 8. RGC23 9. RGC24 10. RGC26 

10 

DETAILED DESCRIPTION OF THE INVENmON 

The in vitro assembled transposition con^lex is stable but catalytically inactive in 
conditions devoid of Mg^* or other divalent cations (Savilahti et al., 1995; Savilahti and 
Mizuuchi, 1996). After electroporation into bacterial cells, these complexes remain 

1 5 functional and become activated for tramposition chemistry upon encountering Mg^* ions 
within the cells, facilitating transposon integration into host chromosomal DNA (Lamberg 
et al., 2002). The in vitro preassembled transpososomes do not need special host cofactors 
for the integration step in vivo (Lamberg et al., 2002). Importantly, once introduced into 
cells and integrated into the genome, the inserted DNA will remain stable in cells that do 

20 not express MuA (Lamberg et al. , 2002). 

To study if the Mu transposition system with the in vitro assembled transpososomes works 
also for higher organisms we constructed transposons (antibiotic resistance markers 
connected to Mu ends), assembled the conq)lexes and tested the transposition strategy and 

25 target site selection after electroporation of yeast or mouse cells. The transposons were 
integrated into the genomes with a 5-bp target site duplication flanking the insertion 
indicating that a genuine DNA transposition reaction had occurred. These results 
dranonstrate that, surprisingly, the conditions in eukaryotic cells allow the integration of 
Mu DNA. Remarkably, the nuclear membrane, DNA binding proteins, or DNA 

30 modifications or conformations did not prevent the mtegration. Furhermore, the structure 
and catalytic activity of the Mu complex retained even after repeated concentration steps. 
This expands the applicability of the Mu transposition strategy into eukaryotes. The benefit 
of this system is that there is no need to generate an expression system of the transposition 
machinery for flie organism of interest 
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The invention provides a method for incorporating nucleic acid segments into ceUular 
nucleic acid of an isolated eukaryotic target cell or a group of such ceUs (such as a tissue 
sample or culture), the method comprising the step of: 

5 

deUvering into the eukaryotic target cell an in vitro assembled Mu transposition complex 
that comprises (i) MuA transposases and (ii) a transposon segment that comprises a pair of 
Mu end sequences recognised and bound by MuA transposase and an insert sequence 
between said Mu end sequences, under conditions that allow integration of the transposon 
1 0 segment into the cellular nucleic acid. 

For the method, one can assemble in vitro stable but catalytically inactive Mu transposition 
conq)lexes in conditions devoid of Mg^* as disclosed in SavUahti et al., 1995 and SavUahti 
and Mizuuchi, 1996. In principal, any standard physiological buffer not containing Mg^* 

15 is suitable for the assembly of said inactive Mu transposition complexes. However, a 

preferred in vitro transpososome assembly reaction may contain 150 mM Tris-HCl pH 6.0, 
50 % (v/v) glycerol, 0.025 % (w/v) Triton X-100, 150 mM NaCl, 0.1 mM EDTA, 55 nM 
transposon DNA fiagment, and 245 nM MuA. The reaction volume may be for example 20 
or 80 microliters. The reaction is incubated at about 30*0 for 0.5 - 4 h, preferably 2 h. To 

20 obtain a sufficient amount of transposition complexes for delivery into the cells, the 

reaction is then concentrated and desalted from several assembly reactions. For the yeast 
transformations the final concentration of transposition complexes compared to the 
assembly reaction is preferably at least tenfold and for the mouse cell transfections at least 
20-fold. The concentration step is preferably carried out by using centrifugal filter units. 

25 Alternatively, it may be carried out by centrifugation or precipitation (e.g. using PEG or 
other types of precipitants). 

In the method, the concentrated tranposition complex fraction is delivered into the 
eukaryotic target cell. The preferred delivery method is electroporation. The 
30 electroporation of Mu transposition complexes into bacterial cells is disclosed in Lamberg 
et al., 2002. However, the method of Lamberg et al camiot be directly employed for 
introduction of the complexes into eukaryotic cells. As shown below in the Experimental 
Section, the procedure for electroporation of mouse embryonic stem (BS) cells described 
by Sands and Hasty (1997) can be used in the method of the invention. A variety of other 
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DNA introduction methods are known for eukaryotic cells and the one skilled in the art cai 
readily utilize tttese methods in order to cany out the mettiod of the invention (see e.g. 
"Electroporation Protocols for Microorganisms", ed. Jac A. Nickoloff, Methods in 
Molecular Biology, volume 47, Humana Press, Totowa, New Jersey, 1995; "Animal Cell 
5 Electroporation and Electrofosion Protocols", ed. Jac A. Nickoloff, Methods in Molecular 
Biology, volume 48, Humana Press, Totowa, New Jersey, 1995; and "Plant cell 
Electroporation and Electrofusion Protocols", ed. Jac A. Nickoloff Methods in Molecular 
Biology, volume 55, Humana Press, Totowa, New Jersey, 1995). Such DNA delivery 
methods include direct injections by the aid of needles or syringes, exploitation of 
10 liposomes, and utilization of various types of transfection-promoting additives. Physical 
methods such as particle bombardment may also be feasible. 

Transposition into the cellular nucleic acid of the target cell seems to follow directly after 
the. electroporation without additional intervention. However, to promote transposition and 
15 remedy the stress caiised by the electroporation, the cells can be incubated at about room 
temperature to 30 °C for 10 min - 48 h or longer in a suitable medium before plating or 
other subsequent steps. Preferably, a single insertion into flie cellular nucleic acid of the 
target cell is produced. 

20 The eukaryotic target cell of the method may be a human, animal (preferably a mammal), 
plant, fungi or yeast cell. Preferably, the animal cell is a cell of a vertebrate such as mouse 
(Mus muscuius), rat {Rattus norvegicus), Xenopus, Fugu or zebra fish or an invertebrate 
such as Drosophila melemogaster or Caenorhabditis elegans. The plant cell is preferably 
fi:om Arabidopsis thaliam, tobacco or rice. The yeast cell is preferably a cell of 

25 Saccharomyces cerevisiae or Schizosaccharomyces pombe. 

The insert sequence between Mu end sequences preferably comprises a selectable marker, 
gene or promoter trap or enhancer trap constructions, protein expressing or RNA producing 
sequences. Such constructs renders possible the use of the method in gene tagging, 
30 functional genomics or gene therapy. 

The term "selectable marker" above refers to a gaie that, when carried by a transposon, 
alters the ability of a cell harboring the transposon to grow or svurvive in a given growth 
environment relative to a similar cell lacking the selectable marker. The transposon nucleic 
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acid of the invention preferably contains a positive selectable marker, A positive selectable 
marker, such as an antibiotic resistance, encodes a product tiiat enables the host to grow 
and survive in the presence of an agent, which otherwise would inhibit the growth of the 
organism or kill it The insert sequence may also contain a reporter gene, which can be any 
gene encoding a product whose expression is detectable and/or quantitatable by 
immunological, chemical, biochemical, biological or mechanical assays. A reporter gene 
product may, for example, have one of the following attributes: fluorescence (e.g., green 
fluorescent protein), enzymatic activity (e.g., luciferase, /ccZ/p-galactosidase), toxicity 
(e.g., ricin) or an ability to be specificaUy bound by a second molecule (e.g., biotin). The 
use of markers and reporter genes in eukaryotic cells is well-known in the art. 

Since the target site selection of /;» viWo Mu system is known to be random or nearly 
random, one preferred embodiment of the invention is a method, wherein the nucleic acid 
segment is incorporated to a random or ahnost random position of tixe cellular nucleic acid 
of the target cell. However, targeting of tiie transposition can be advantageous in some 
cases and thus another preferred embodiment of the invention is a metiiod, wherein the 
nucleic acid segment is incorporated to a targeted position of tiie cellular nucleic acid of 
the target cell. This could be accomplished by adding to the transposition complex, or to 
the DNA region between Mu ends in tiie transposon, a targeting signal on a nucleic acid or 
protein level. Said targeting signal is preferably a nucleic acid, protein or peptide which is 
known to efficientiy bind to or associate witii a certam nucleotide sequence, thus 
facilitating targeting. 



One specific embodiment of the invention is the method wherein a modified MuA 
transposase is used. Such MuA tiransposase may be modified, e.g., by a deletion, an 
insertion or a point mutation and it may have different catalytic activities or specifities tiian 
an unmodified MuA. 



Anotiier embodiment of tiie invention is a metiiod for forming an insertion mutant library 
fixim a pool of eukaryotic target cells, the method comprising the steps of: 

a) deUvering into the eukaryotic target cell an in vitro assembled Mu ti^sposition complex 
tiiat comprises (i) MuA transposases and (u) a transposon segment tiiat comprises a pair of 
Mu end sequences recognised and bound by MuA ti:ansposase and an insert sequence witii 
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a selectable maricer between said Mu end sequences, under conditions that allow 
integration of the transposon segment into the cellular nucleic acid. 

b) screening for cells that comprise the selectable marker. 

In the above method, a person skilled in the art can easily utilise different screening 
techniques. The screening step can be performed, e.g., by methods involving sequence 
analysis, nucleic acid hybridisation, primer extension or antibody binding. These methods 
are well-known in the art (see, for example. Current Protocols in Molecular Biology, eds. 
Ausubel et al, John Wiley & Sons: 1992). Libraries formed according to the the method of 
the invention can also be screened for genotypic or phenotypic changes after transposition. 

FurfliOT embodiment of the invention is a kit for incorporating nucleic acid segments into 
cellular nucleic acid of a eukaryotic target cell. The kit comprises a concentrated fraction 
of Mu transposition complexes that comprise a transposon segment with a marker, which is 
selectable in eukaryotic cells. Preferably, said complexes are provided as a substantially 
pure preparation apart from other proteins, genetic material, and the like. 

The publications and other materials used herein to illuminate the background of the 
invention, and in particular, to provide additional details with respect to its practice, are 
incorporated herein by reference. The invention will be described in more detail in the 
following Experimental Section. 

EXPERIMENTAL SECTTQN 
MATERIALS AND METHODS 
Strains, cell lines and media 

Hie Eschericia coli DH5a was used for bacterial transformations. The bacteria were grown 
at 37 °C in LB broth or on LB agar plates. For the selection and maintenance of plasmids, 
antibiotics were used at the following concentrations: ampicillin 100- 1 50 pg/ml, 
kanamycin 10-25 ng/ml, and chloramphenicol 10 ng/ml. The Saccharomyces cerevisiae 
strain FY1679 {MATa/MATa um3-52/ura 3-52 his3A200/HIS3 leu2Al/LEU2 
trplA63/TRPl GAL2/GAL2; Winston et al. 1995) and its haploid derivative FY-3 (MATa 



wo 2004/090146 



PCT/FI2004/000228 



13 

HIS LEU TRP ura3-52) were used for yeast transfoimations. The yeasts were grovra on 
YPD (1 % yeast ejctract, 2 % peptone, 2 % glucose) or minimal medium (0.67 % yeast 
nitrogen base, 2 % glucose). For the selection of the transfoimants, yeast cells were grown 
on YPD plates containing 200 ^ig/ml of G418 (geneticin, Sigma). 

5 

The procedures required for propagating moxxse AB2.2-Prime embryonic stem (ES) cells 
(Lexicon Genetics, Inc.) have been described by Sands and Hasty (1997). Briefly, 
undifferentiated AB2.2-Prime ES cells were grown on 0.1 % gelatin (Sigma)-coated 
tissues culture plates in the ES culture medium consisting of DMEM (Gibco) 
10 supplem«ited with 1 5 % fetal bovine serum (Hyclone), 2 mM L-glutamine (Gibco), 1 mM 
Sodium pyruvate (Gibco), 100 nM P-Mercaptoethanol and nonessential amino acids 
(Gibco), 50 U/ml Penicillin, 50 ng/ml Streptomycin (Gibco), and 1000 U/ml LIF 
(Chemicon). 



1 5 HeLa S3 cells (ATCC # CCL-2,2) were grown in cell culture medium consisting of MEM 
supplemented with 10% fetal bovine serum (Gibco Invitrogen), 2 mM L-glutamine (Gibco 
Invitrogen), 50 U/ml Penicillin (Gibco Invitrogen), and 50 jig/ml Streptomycin (Gibco 
Invitrogen). 



20 Proteins and reagents 

MuA transposase (MuA), proteinase K, calf intestinal alkaline phosphatase (CIP) and 
Cam*^ Entranceposon (TGS Template Generation System) were obtained from Finnzymes, 
Espoo, Finland. Restriction endonucleases and the plasmid pUC19 were from New 
England Biolabs. Klenow enzyme was from Promega. Enzymes were used as 

25 recommended by the suppliers. Bovine serum albumin was from Sigma. [a^¥]dCTP 
(1000-3000 Ci/mmol) was from Amersham Biosciences. 

Construction of kanMX4-Mu transposons 

The kanMX4 selector module (1.4 kb) was released from the pFA6-kanMX4 (Wach et al. 
30 1 994) by EcdBl + BglB. double digestion and ligated to the 0.75 kb vector containing the 
pUC miniorigin and the Mu ends, producmg the kanMX4-Mu plasmid, pHTHl. Plasmid 
DNA was isolated with the Plasmid Maxi Kit (QIAGEN). To confirm the absence of 
mutations in the kanWC4 module the insert was sequenced following the in vitro 
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transposition reaction with the Cam^ Entranceposon as a donor DNA and the plasmid 
pHTHl as a target DNA with primers Muc 1 and Muc2. 

• The primera for sequencing the yeast constructs were Mucl : 
5'-GCTCTCCCCGTGGAGGTAAT-3' (SEQ ID NO.l) and Muc2: 
5'-TTCCGTCACAGGTATTTATTCGGT-3' (SEQ IDNO:2). 

We also constructed a transposon with a bacterial replicon between the Mu ends to allow 
easier outcloning. The pl5A replicon was cut from the plasmid pACYC184 (Rose 1988) 
with Sphl, blunted with Klenow enzyme, and ligated into £'coRI-cut end-filled pHTHl to 
produce kanMX4-pl5A-Mu plasmid, pHTH4. 

Construction of Mu/LoxP-Kan/Neo transposon 

A neomycin-resistance cassette containing a bacterial promoter, SV40 origin of replication, 
SV40 early promoter, Iranamycin/neomycin resistance gene, and Herpes simplex virus 
thymidine kinase polyadenylation signals was generated by PGR from pIRES2-EGFP 
plasmid (Clontech). After addition of LoxP sites and Mu end sequences using standard 
PCR-based techniques, the construct was cloned as a BglR fragment into a vector 
backbone derived from pUC19. The construct (pALH28) was confirmed by DNA 
sequencing. 

Assembly and concentration of transpososomes 

The transposons (kanMX4-Mu, 1.5 kb; kanMX4-pl5A-Mu, 2.3 kb; Mu/LoxP-Kan/Neo, 
2. 1 kb) were isolated by BglO. digestion from their respective carrier plasmids (pHTHl, 
pHTH4, pALH28). The DNA fragments were purified chromatogr^hically as described 
(Haapa et al. 1999a). 

The standard in vitro transpososome assembly reaction (20 p,l or 80 pi) contained 55 nM 
transposon DNA fragment, 245 nM MuA, 150 mM Tris-HCl pH 6.0, 50 % (v/v) glycerol, 
0.025 % (w/v) Triton X-100, 150 mM NaCl, 0.1 mM EDTA. The reaction was carried out 
at 30*C for 2 h. The complexes were concentrated and desalted from several reactions by 
Centricon concentrator (Amicon) according to manufecturer's instructions and washed 
once with water. The final concentration for the yeast transformations was approximately 
tenfold and for the mouse transfections about 20-fold. 
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Electrocompetent bacterial and yeast cells 

Electroconq)etent bacterial cells for standard cloning were prepared and used as described 
(Lamberg et al., 2002). Electrocompetent S. cerevisiae cells were grown as foUows. An 
overnight stationary phase culture was diluted 1: 10 000 in fresh YPD (1 % yeast extract, 2 
% peptone, 2 % glucose) and grown to A«,o 0.7 - 1.2. The cell pellets were collected by 
centrifugation (5000 rpm). suspended in 14 volume of 0.1 M lithium acetate, 10 mM 
dithiotreitol, 10 mM Tris-HCl pH 7.5, 1 mM EDTA (LiAc/DTT/TE) and incubated at 
room temperature for 1 h. The repelleted cells were washed with ice-cold water and again 
collected by centrifugation. The pellet was tiien resuspended in 1/10 of the original volume 
of ice-cold 1 M sorbitol. FoUowing centrifugation, the peUet was suspended in ice-cold 1 
M sorbitol to yield ~200-fold concentration of the original culture density. One hundred 
microliters of cell suspension were used for each electroporation. For conqietence status 
determionation, transpososomes or plasmid DNA were added to the cell suspension and 
incubated on ice for 5 min. The mixture was transferred to a 0.2 cm cuvette and pulsed at 
1.5 kV (diploid FY 1679) or 2.0 kV (haploid FY-3), 25 yxF, 200 ohms with Bio-Rad 
Genepulser n. After electroporation 1 ml of YPD was added, and cultures were incubated 
at aO'C for 0-4 hours. Subsequently cells were plated on YPD plates containing 200 ng/ml 
of G418. The competent status of the yeast strains was evaluated in parallel by 
electroporation of a control plasmid pYC2/CT (URA3, CEN6/ARSH4, amp^ pUC ori. 
Invitrogen) and plating the cells on minimal plates. 

Mouse ES cell transfection and colony isolation 

The procedures used for electroporation of mouse AB2.2-Prime embryonic stem (ES) ceUs 
have been described by Sands and Hasty (1997). Briefly, the AB2.2-Prime ES ceUs were 
collected m phosphate-buffered saline (PBS) at a density of 1 IxlO*' cells/ml. 2.2-2.3 ng of 
the transposon complexes or linearized DNA was added to an 0.4 cm electroporation 
cuvette. For each electroporation, 0.9 ml of ES cell suspension (approximately 10 x 10** 
cells) was mixed with transpososomes or linear DNA. Electroporation was carried out 
using Bio-Rad's Gene Pulser and Capacitance Extender at 250 V, 500 ^F. After 
electroporation the ceUs stood at RT for 10 min and were then plated in gelatin coated 
plates.The electroporated ES cells were cultured in the conditions mentioned above for 24- 
48 hours before adding G418 (Gibco) to a final concentration of 150 ng/ml to select 
transposon insertions. Transfected colonies of ES cells were picked after 10 days in 
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selection and individual colonies were cultured in separate wells of the 96-wells or 24- 
wells plates using the conditions described above. 

HeLa cell transfection and colony isolation 
5 The HeLa cells were electroporated basically according to the instructions by ATCC. 
Briefly, the HeLa cells were collected m phosphate-buffered saline (PBS) at a density of 
1.8 X 10* cells/ml. 2 - 2.3 ng of the transposon complexes or linearized transposon DNA 
was added to an 0.4 cm electroporation cuvette. For each electroporation, 0.9 ml of HeLa 
cell suspension (approximately 1.6 x 10^ cells) was mixed with transpososomes or linear 

10 DNA. Electroporation was earned out using Bio-Rad's Gene Pulser and Capacitance 

extender at 250 V, SOOjiF. After electroporation the cells stood at RT for 10 min and were 
then plated. The electroporated cells were then cultured in the conditions mentioned above 
for 60 hours before adding G418 (Gibco Invitrogen) to a final concentration of 400 |Jig/ml 
to select transposon msertions. Transfected colonies of HeLa cells were picked after 10-1 1 

15 days in selection and individual colonies were cultured first in separate wells of the 96- 
wells plate, and transferred later to separate wells of 24-wells or 6-wells plates and 10 cm 
plates using the conditions desoibed above. 

Isolation of genomic DNA 
20 Yeast Genomic DNA of each geneticin resistant yeast clone was isolated either with 

QIAGEN Genomic DNA Isolation kit or according to Sherman et al., 1981. 

Mouse ES cells Genomic DNA was isolated firom ES cell essentially according to the 
method developed by Miller et al. (1988). ES cells were collected fi-om individual wells 

25 firom the 24-well cultures and suspended to 500 p-l of the proteinase K digestion buffer (10 
mM Tris-HCl (pH 8.0), 400 mM NaCl, 10 mM EDTA, 0.5 % SDS, and 200 ^ig/ml 
proteinase K). The proteinase K treatment was carried out for 8-16 hours at 55''C. 
Following the proteinase K treatment 1 50 jil of 6 M NaCl was added followed by 
centrifiigation at microcentrifiige (30 min, 13 K). The supernatant was collected and 

30 precipitated with ethanol to yield DNA pellet that was washed with 70% ethanol and air- 
dried. DNA was dissolved m TE (10 mM Tris-HCl, pH 8.0 and 1 mM EDTA) buffer. 



HeLa cells Genomic DNA was isolated firom HeLa cells essentially according to the 
method developed by Miller et al. (1988). HeLa cells were collected &om three 10 ci 
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plates and suspended to 15 ml of proteinase K digestion buffer (10 mM Tris-HCl (pH 8.0), 
400 mM NaCl. 10 mM EDTA, 0.5% SDS. and 200-400 ^g/ml proteinase K). The 
proteinase K treatment was carried out at 55'»C for 16-48 hours or until no ceUs were 
visible. RNase was added at 25-50 ^ig/ml and incubated at 37°C for 8-24 hours. Following 
5 the RNase treatment 4.5 ml of 6 M NaCl was added followed by centrifugation (SS-34, 
1 1.6-14 BC, 20-30 min, 4°C). The supernatant was collected and precipitated with ethanol to 
yield DNA pellet that was washed with 70% ethanol and air-dried. DNA was dissolved in 
TE (10 mM Tris-HCl (pH 8.0) and 1 mM EDTA) buffer. 

10 Southern blot 

Yeast The DNA was digested with appropriate enzymes. The fragments were 
electrophoresed on a 0.8 % agarose gel and blotted onto Hybond N+ membrane 
(Amersham). Southern hybridisation was carried out with [o^^pj^jCTP -labelled (Random 
Primed, Roche) kanMX4 (Bgm-EcoBl fragment) as a probe. 

15 

Mouse ES cells DNA Southern blot hybridisation was performed according to standard 
methods as described (Sambrook, et al., 1989). 10-15 ^g of the wild type and transfected 
AB2.2-Prime ES cell DNAs were digested with various restriction enzymes and separated 
on 0.8% agarose gels. The DNA was transferred to a nylon filter (Hybond N+, Amersham) 

20 and fixed with UV (Stratalinker, Statagene). Inserted DNA was visualized by hybridisation 
with a [a-^^p] dCTP-labeled (RediprimeH, Amersham) DNA probes (Mu/LoxP-Kan/Neo 
BamHl fragment). Hybridisation was performed at 65**C for 16 hours in solutions 
contaming 1.5 x SSPE, 10% PEG 6000, 7% SDS, 100 Mg/ml denatured herring sperm 
DNA. After the hybridisation, the filter was washed twice 5 min and once 15 min in 

25 2xSSC, 0.5% SDS at 65''C and once or twice for 10-15 min in the O.lxSSC, 0. 1%SDS at 
65°C. The filter was exposed to a Fuji phosphoimager screen for 8-16 hours and processed 
in a FujiBAS phosphoimager. 

HeLa cells Southern blot hybridisation was performed according to standard methods as 
30 described (Sambrook et al., 1989). 10 pg of the wild type and transfected HeLa ceU DNAs 
were digested with Nhel + Spel and separated on 0.8% agarose gel. The DNA was 
transferred to a nylon filter (Hybond N+, Amersham) and fixed with UV (Stratalinker, 
Stratagene). Inserted transposon DNA was visualized by hybridisation with a [a-^^P] 
dCTP-labeled (RediprimeH, Amersham) DNA probe (Mu/LoxP-Kan/Neo transposon). 
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Hybridisation was performed at SS'^C for 16 hours in solutions containing 1.5 x SSPE, 10% 
PEG 6000, 7% SDS. 100 ^g/ml denatured herring spenn DNA. After the hybridisation, the 
filter was washed three times for 20-40 min in 2 x SSC, 0.5% SDS at 65»C and three times 
for 20-40 min in 0.1 X SSc, 0.1% SDS at 65«»C. The filter was exposed to a Fuji 
phosphoimager screen for 8-16 hours and processed in a FujiBAS phosphoimager 

Determination of tai^et site duplication 

Cloning. Yeast genomic DNA was digested with Bamm + Bgia, SalLi- Xhol or PyuTL to 
produce a Augment with a transposon attached to its chromosomal DNA flanks. These 
firagments were then cloned into pUC19 cleaved with Bamm, Sail or Smal, respectively, 
selecting for kanamycin and ampicillin resistance. Alternatively, clones transfected with 
kanMX4-pl5A were cleaved with BamW. + BglTL, Hgated, electroporated and selected for 
resistance produced by the transposon containing firagments. DNA sequences of transposon 
borders were determined from these plasmids using transposon specific primers SeqA and 
SeqMX. Genomic locations were identified using the BLAST search at SGD 
(Saccharomyces Genome Database; ht^://genome-www.stanford.edu/SaccharomycesO or 
SDSC Biology WorkBench (htlp://woricbench.sdsc.edu/) servers. 

The primers for sequencing tiie ends of cloned yeast inserts were Seq A: 
5*-ATCAGCQGCCGCGATCC-3' (SEQ ID NO:3) and Seq MX4: 
5'-GGACGAGGCAAGCTAAACAG-3' (SEQ ID NO:4). 

PGR amplification. Two micrograms of yeast genomic DNA was digested with Bamm 
+Bgm ovNhel + Spel. Specific partially double-stranded adapters were made by annealing 
2 mM adapter primer 1 (WAP-1) with complementaiy 2 adapter primer 2 (WAP-2*), 3 
(WAP-3*), or 4 (WAP-4*). The 3' OH group of tiie WAP.2*, WAP-3*, and WAP-4* 
primers was blocked by a primary amine group and the 5' ends were phosphorylated. The 
restriction Augments (200 ng) g«ierated by Bamm + Bgm. were Ugated witii 22 ng of 
adapter tiiat was made by annealing primers WAP-1 and WAP-2*, whereas the restriction 
fi:agments generated with Nhel + Spel were Ugated witii tiie 22 ng of adapter made by 
annealing primers WAP-1 and WAP-3* One fifth of tiie ligation reaction was used as a 
template to perform PGR ampUfication at 20 ^1 to enrich for DNA fragments between the 
adapter and tiie transposon with primers Walker-l and TEFterm-1 or Walker-1 and 
TEFprom-1. PGR conditions were 94''C, 1 min, 55 °G, 1 min. 72 °C, 4 min for 30 cycles. 
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Nested PGR was carried out at 50 jil using 2 \)1 of one hundred-fold diluted primary PC3R. 
products as a template using primers WaIker-2 and TEFterm-2 or Walker-2 and TEFprom- 
2 for PGR products produced fiom BamHl + BglR fragments and Walker-3 and TEFteim-2 
or Walker-3 and TEFprom-2 for PGR products produced from the Nhel + Spel fragments. 
5 The PGR conditions were as before. The amplified nested PGR products were sequenced 
using sequencing primer Mu-2. 



One microgram of mouse genomic DNA was digested with BglH + Bell or Nhel + Spel. 
Specific partially double-stranded adapters were made as for the yeast. The restriction 

1 0 firagments (400 ng) generated by Bcil + BglD. were ligated with 44 ng of adapter that was 
made by annealing primers WAP-1 and WAP-2*, whereas the restriction fiagments (200 
ng) generated with Nhel + Spel were ligated with the 22 ng of adapts made by annealing 
primers WAP-1 and WAP-3*. Respectively, one fourth or one fifth of Ihe ligation reaction 
was used as a tenqjlate to perform PGR amplification at 20 ^1 to enrich for DNA Augments 

1 5 between the adapter and the transposon with primers Walker- 1 and HSP43 0 or Walker- 1 
and HSP431. PGR conditions were 94*0, 1 min, 55 "C, 1 min, 72 "C, 4 min for 30 cycles. 
Nested PGR was carried out at 50 ^1 using 2 jul of eighty fold or one hundred-fold diluted 
primary PGR products as a template using primers Walker-2 and HSP429 or Walker-2 and 
HSP432 for PGR products produced from BcH + Bgia fragments and Walker-3 and 

20 HSP429 or Walker-3 and HSP432 for PGR products produced from the Nhel + Spel 

fragments. The PGR conditions were as before. The amplified nested PGR products were 
sequenced using sequencing primer Mu-2. 

Primers for PGR-based detection: 

25 WAP-1 GTAATAGGAGTGAGATAGGGGGGGCGCGGGGGG(SEQIDNO:5) 
WAP-2* GATGGGGGGGGCG-NH2 (SEQ ID NO:6) 



WAP-3* GTAGGGGGGGGGG-NH2 (SEQ ID NO:7) 



30 WAP-4* AATTGGGGGGGGG-NH2 (SEQ ID NO:8) 



Walker-1 GTAATAGGAGTGAGATAGGG (SEQ ID NO:9) 
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Walker-2 GGGCGGCCGCCCGGGCGATC (SEQ ID NO: 10) 
Walker-3 GGGCGGCCGCCCGGGCCTAG (SEQ ID NO: 1 1) 

Walker-4 GGGCGGCCGCCCGGGCAATT (SEQ ID NO: 12) 
TEFtenn-1 CTGTCGATTCGATACTAACG (SEQ ID NO: 13) 
TEFterm-2 CTCTAGATGATCAGCGGCCGCGATCCG (SEQ ID NO: 14) 

TEFprom-1 TGTCAAGGAGGGTATTCTGG (SEQ ID N0:15) 

TEFpiDm-2 GGTGACCCGGCGGGGACGAGGC (SEQ ID NO: 16) 
Mu-2 GATCCGTTTTCGCATTTATCGTG (SEQ ID NO: 17) 



HSP429 GGCCGCATCGATAAGCTTGGGCTGCAGG (SEQ ID NO: 1 8) 

HSP430 ACATTGGGTGGAAACATTCC (SEQ ID NO: 19) 

HSP43 1 CCAAGTTCGGGTGAAGGC (SEQ ID NO:20) 

HSP432 CCCCGGGCGAGTCTAGGGCCGC (SEQ ID NO:21) 

HeLa cells The genomic HeLa cell DNA was digested with BamHI + BcU to produce a 
fiagment with a transposon attached to its chromosomal DNA flanks. These fragments 
were then cloned into pUC19 cleaved with BamHI, selecting for kanamycin and ampicillin 
resistance. DNA sequences of transposon borders were determined from these plasmids 
usmg transposon specific primers HSP430 and HSP431. Genomic locations were identified 
usmg the SSAHA search at Ensembl Human Genome Browser Release 20.34c. 1 which is 
based on the NCBI 34 assembly of the human genome. 

RESULTS 



Transposon construction and its mtroduction to the cells 
To study if the Mu transposition system works also for eukaryotes (Figure 1) we 
constructed a kanMX4-Mu transposon containing the kan'^ gene from TnPOJ and 
translational control sequences of the TEF gene of Ashbya gossypii between the Mu ends, 
with or without additional bacterial pl5A replicon between the Mu ends (Figure 2A). We 
studied the assembly of Mu transpososomes by incubating MuA protein with the kanMX4- 
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Mu transposon and detected stable protein-DNA complexes by agarose gel electrophoresis 
(Figure 3). The reactions with kanMX4-Mu and kanMX-pl5A-Mu transposons produced 
several bands of protein-DNA complexes that disappeared when the sample was loaded in 
the presence of SDS indicating that only non-covalent protein-DNA interactions were 
5 involved in the complexes. An aliquot of assembly reactions with and without MuA 

transposase were electroporated into Saccharomyces cerevisiae cells and the yeasts were 
scored for geneticin resistance. The competent status of the yeast strains was evaluated in 
parallel by electroporaticn of a control plasmid pYC2/CT. The electroporation efficiency 
with the transpososomes into the yeast was approximately three orders of magnitude lower 
10 than the efficiency with the plasmid (Table 1). This result is consistent with previous 

results with bacteria (Lamberg et al 2002). Only the sample containing detectable protein- 
DNA complexes yielded geneticin resistant colonies. 

For mouse experiments we constracted a Mu/loxP-Kan/Neo transposon that contained 
15 bacterial and eukaiyotic promoters, kanamycin/neomycin resistance gene, and Herpes 
simplex virus thymidine kinase polyadenylation signals (Figure 2B). The transfection of 
the mouse ES cells with the transpososome resulted in 1720 G418 resistant colonies per |ig 
DNA and the linear control in 330 resistant colonies per jig DNA. Thus the transfection 
with the transpososome yielded over 5 times more resistant colonies per [xg DNA. The 
20 control cells with no added DNA did not produce any resistant colonies. 

In HeLa cells, transfection with the transpososomes resulted in about 10^ resistant colonies 
per |j.g DNA and transfection with the linear control DNA resulted m about 10^ resistant 
colonies per ^ig DNA. Thus the transpososomes were significantly more efficient in 
25 generating transfectants. The control cells with no added transposon did not produce any 
resistant colonies. 

Integration of the transposon into the genome 

Southern blot analysis can be used to study whether the transposon DNA was inserted into 
30 the genomic DNA of the geneticin-resistant colonies. Digestion of genomic DNA with 
enzyme(s) which do not cut the transposon produces one firagment hybridising to the 
transposon probe, and digestion with an enzyme which cuts the transposon once produces 
two firagments in the case of genuine Mu transposition. Genomic DNA from 17 kanMX4- 
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Mu transposon integration yeast clones was isolated, digested with BamHI + Bgia that do 
not cut the transposon sequence, or with Hindm that cleaves the transposon sequence once 
and analysed by Southern hybridisation with kanMX4 fragment as the probe. Fifteen 
isolates generated a single band with a discrete but different gel mobility after BamHI + 
Bgm digestion (Figure 4A) and two bands after Hintm digestion (Figure 4B). Control 
DNA from the recipient strain FY1679 did not generate detectable bands in the analyses. 
Two isolates (G5 and G14) gave several hybridising fragments after Bamm + BgUl 
digestion suggesting possibiUty of multiple transposon integrations. However, these two 
isolates gave three fragments after Hindm digestion, instead of doubling the amount of 
fragments detected in the BamHI + BglJI digestion expected in case of multiple transposon 
integrations. The sizes of the Hindm fragments of the isolates G5 and G14 (4.3, 2.4 and 
1 .3 kb) and the pattern of bands in BamHI + BglU digestion suggested that the transposon 
was integrated into the yeast 2n plasmid (for confirmation of this see sequencing results 
below). Genomic DNA from 17 G418-resistant isolates of the haploid strain FY-3 was 
analysed in a similar way after ;5^oI + Sail digestion (which do not cut the frausposon) and 
Pstl digestion (one cut in the transposon). Thirteen isolates gave one band after JiTioI + Sail 
digestion and two bands after Pstl digestion suggesting a single integration. Four isolates 
gave similar pattern of bands as isolates G5 and G14 of strain FY1679 suggesting 
integration into the 2^ plasmid (results not shown). These data indicate that in most of the 
studied clones the transposon DNA was integrated as a single copy into the yeast 
chromosome. In the rest of the clones a single integration was detected in an episome. 

Seven mouse ES cell clones were analysed by Southern blotting. Their chromosomal DNA 
was digested with BamHI which releases ahnost an entire transposon from the genome. All 
the clones studied had a band at the same position as the BamHI digested p ALH28 used as 
a control. The mtensity of the band was similar for all clones studied and for control DNA 
representing same molar amount of DNA as the genomic samples. This suggests that only 
one copy of the transposon was integrated into each genome. 

In HeLa cells. Southern blot analysis was used to confirm that the G418 resistant colonies 
had the transposon integrated into their genomes. Digestion of the genomic DNA with 
restriction enzyme(s) that do not cut the transposon produces one fragment hybridising to 
the transposon probe. Seven HeLa cell tiansfectant clones were analysed by Southern blot 
as shown in Figure 6. Their chromosomal DNA was digested withiV7je/+ S^el wMch do 
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not cut the transposon. A single band was detected fiom each of the clones indicating that a 
single copy of the transposon DNA has been integrated in each of the genomes. 

The location of insertions in the chromosomes 

Yeast Mu transposons integrate ahnost randomly into the target DNA (Haapa-Paananen et 
al., 2002). To test the location and distribution of the transposon insertions we cloned 
transposon-genomic DNA borders from more than one hundred yeast transformants and 
sequenced the insertion sites on both sides of the transposon using transposon-specific 
primers (Seq A + Seq MX4). Exact mapping of the insertion sites was possible by BLAST 
comparison with the SGD database. We used the strain FY1679 which was used in the 
yeast whole genome sequencing (Winston et al. 1995) to ensure the correct mapping. The 
overall distribution of 140 integrations on the 16 chromosomes of the yeast is shown in 
Figure 5 A. All chromosomes were hit at least once. Both ORFs and intergenic regions had 
transposon integrations (Table 2). List of integrations into the genome is presented in Table 
3. In the haploid genome, integrations on the essential genes were naturally missed due to 
the inviability of the cells. On chromosome Xn there seems to be a real "hotspot" for 
transposon integration but this is an artefact since the "hotspot" is in the approximately 9 
kb region encoding ribosomal RNA (Figure 5B). This loci is repeated tandemly 100-200 
times in the chromosome XU. In this region, the integrations are distributed randomly. The 
chromosomes in Figure 5A are drawn according to SGD which shows only two copies of 
this repeated region (when the systematic sequencing of the yeast genome was done, only 
two rDNA repeats were sequenced) instead of 100 to 200 copies actually present in a yeast 
genome consisting of 1 to 2 Mb of DNA. Only nine integrations were found at a distance 
less than 1 kb from a tRNA gene which shows that Mu-transposon integration differs from 
that of Ty 1-Ty4 elements. Integration closest to the end of a chromosome was 6.3 kb 
showing the difference to the telomere-preferring Ty5 element. The mean interval distance 
of insertions was 135 kb and was nowhere near covering the whole genome as a library. 
However, the distribution was evrai enough to show tiie randomness of the integration. 

Mouse The sequenced transposon-genomic DNA borders were compared to the Mouse 
Genome Assembly v 3 using Ensembl Mouse Genome Server. The clone RGC57 
contained an integrated transposon in the chiomosome 3, duplicating positions 59433906- 
10, which are located in the last intron of both the ENSMUSESTG00000010433 and 
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10426. Sequencing showed presence of tbis 5-bp sequence (target site dupUcation) on both 
sides of the integrated transposon. 

HeLa cells We cloned transposon-genomic DNA borders from three transfectants and 
sequenced the insertion sites on both sides of the transposon using transposon-specific 
primers (HSP 430 and HSP431). The integrations are presented in Table 5. All of these 3 
transfectants had intact transposon ends with the 5 bp duplication of the target site at both 
sides of the transposon. 

Integration of the transposon in the yeast 2n plasmid 

Most S. cerevisiae strains contain an endogenous 2)i plasmid. The yeast 2n plasmid is a 
63 18 bp circular species present extrachromosomally in 51 cerevisiae at 60-100 copies per 
cell. The plasmid molecules are resident in the nucleus as minichromosomes with standard 
nucleosome phasing (Livingston and Hahne 1979; Nelson and Fangman 1979; Taketo et 
al., 1980), 

In 23 clones out of 131 clones (17.6 %) the transposon had integrated in the 2\x plasmid 
and in 108 clones (82.4 %) the transposon had integrated into the chromosomes in the 
diploid strain FY1679. In the haploid strain FY-3, four clones out of 49 clones (8.2 %) had 
the transposon in the 2^ plasmid and 45 clones (91.8 %) had the transposon in the 
chromosomes. 

Transposon target site 

Genuine Mu transposition produces a 5-bp target site duplication flanking the integrated 
transposon (Haapa et al. 1999b). The transposon was flanked by target site duplication in 
121 clones (out of 122; 99.2 %) of the strain FY1679 and in 42 clones (out of 46; 91.3 %) 
in the haploid strain FY-3 confirming that a majority of integrations were generated by 
DNA transposition chemistry. A consensus sequence of 5 bp duplication (5'-N-Y-G/C-R- 
N-3') has been observed in both in vivo and in vitro transposition reactions, the most 
preferred pentamers being 5'-C-Y-G/C-R-G- 3' (Mizuuchi and Mizuuchi 1993; Haapa- 
Paananen et at. 2002; Butterfield et al. 2002). In this study, the distribution of nucleotides 
in duplicated pentamers agreed well with the previous results (Table 4). 
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Table 1 . Number of geneticin-resistant colonies detected following electroporation of 
transpososomes into yeast strains, cfu/ pig DNA 

DNA FY1679 FY-3 

KanMX-Mu +MuA 351 178 

KanMX-Mu - MuA 0 1 

KanMX-pl5A-Mu + MuA 43 61 

KanMX-pl5A-Mu - MuA 0 0 

PlasmidpYC2/Cr 6.9x10^ 5.6x10^ 

" Electroporation of plasmid pYC2/CT DNA served as a control for competent status. 



Table 2. Distribution of transposon integrations in FY1679 (diploid) and 
FY-3 (haploid) strains. 



Integration site FY1679 FY-3 Total 

Chromosomal DNA 



Protein coding region 






53 


Essential gene 


12 (1 intron) 


0 




Nonessential gene 


29 


11 




rRNA 


12 


7 


19 


tKNA (intron) 


1 


0 


1 


Ty 


2 


0 


2 


Solo-LTR 


1 


2 


3 


Intergenic region 


48 


23 


71 


2p. plasmid 








Protein coding region 


4 


2 


6 


Intergenic region 


12 


2 


14 




121 


47 


169 
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Table 3A. Transposon integration sites and target 
Saccharomyces cereviaiae diploid strain FY1679. 
*-sec[mx4 seqA-> 



Gl 
G2 
G3 
G4 
G5 
G6 
G7 
G8 
G9 
GIO 
Gil 
G12 
G13 
G14 
G15 
G16 
G17 
G18 
G19 
G20 
G21 
G22 
G23 
G34 
G36 
G37 
G38 
G40 
G41 
G42 
G43 
G44 
G45 
G46 
G47 
G48 
G49 
G50 
G51 
G52 
G53 
G54 
G55 
G56 
G57 
G58 
G59 
G60 
G63 
G64 
G65 
G66 
G67 
G67 
G70 
G71 
G74 
G75 



caacatctagercAG (KanMX4-Mu) CTCAGtgagttccga 
agtac taccaTTGAA {KanMX4 -Mu) TTGAAtttacgttca 
taaaaattcaGGCAT (KanMX4-Mu) GGCATatacaattat 
taaaccaccaTCTGT (KanMX4-Mu) TCTGTcgcccatctt 
ctgattactaGCGAA (KanMX4 -Mu) GCGAAgctgcgggtg 
aagaaaagctCAGTG (KanMX4-Mu) CAGTGgaataatttt 
gaactctttcCCCAC (KanMX4-Mu) CCCACcgatccattg 
aaagatgaaaCCGAG (KanMX4-Mu) CCGAGtaagctgcta 
caatgcatcaTCTAC (KanMX4-"Mu) TCTACattacaaacc 
tttgttcacgCGGGC (KanMX4-Mu) CGGGCcgcagttgtg 
atctgtat taACTTC {KanMX4 -Mu) ACTTCgaggtagtaa 
ttttcatgttCCTAT (KanMX4-Mu) CCTATtcttgttctt 
tatccacttcTTAGA (KanMX4-Mu) TTAGAgggactatcg 
aaactgttttACAGA (KanMX4 -Mu) ACAGAtt tacgatcg 
tggagttaggCTGGC (KanMX4-Mu) CTGGCtcggactggc 
gagcttctgcTTCAC (KanMX4-Mu) TTCACgttttttgga 
taacgctagaGGGGC (KanMX4-Mu) GGGGCaagaaggaag 
tccaaccgtaGTGGT (KanMX4-Mu) GTGGTtatataataa 
gggggcaatgGTGAA{KanMX4-Mu) GTGAAatttcgacgc 
taagagcttgTCCGC (KanMX4-Mu) TCCGCttcgccccaa 
cataagtgtaAGCCA (KanMX4 -Mu) AGCCAtatgttccct 
tctggcttaaACCAG (KanMX4-Mu) ACCAGcactatgtat 
gttgaatcttCCGAT (KanMX4-Mu) CCGATaccatcgaca 
ccctagcgccTAGGG (KanMX4-Mu) TAGGGtcgagtactg 
ttgctttaacTAGGA (KanMX4 -Mu) TAGGAaagaataaga 
agagactgaaGACGA (KanMX4-Mu) GACGAggaaatcaaa 
atggatggcgCTCAA (KanMX4-Mu) CTCAAgcgtgttacc 
tccatcttctGTGGA{KanMX4-Mu) GTGGAgaagactcga 
ttcactcattCTGGT (KanMX4-Mu) CTGGTcatttcttcg 
ctagcgctttACGGA (KanMX4 -Mu) ACGGAagacaatgta 
ggtaataggcCCGTG (KanMX4-Mu) CCGTGcggttccgtc 
gtggtgccctTCCGT (KanMX4-Mu) TCCGTcaattccttt 
ttcgctgctcACCAA (KanMX4 -Mu) ACCAAtggaatcgca 
aatattatctTCTGT (KanMX4-Mu) TCTGTcattgttact 
gtatgtacccACCGA (KanMX4 -Mu) ACCGAtgtagcagta 
gttgatggtaCCTTG (KanMX4-Mu) CCTTGacaccagcca 
tacattgtctTCCGT (KanMX4 -Mu) TCCGTaaagcgctag 
ccgtggaagcCTCGC (KanMX4-Mu) CTCGCccgatgagtt 
tttcttttccTCCGC (KanMX4-Mu) TCCGCttattgatat 
gctgcgtctgACCAA(KanMX4-Mu)ACCAAggccctcact 
tactgttgaaCCGGG (KanMX4 -Mu) CCGGGtcgtacaact 

caaatgtatcAGCAG(KanMX4-Mu)AGCAGatgtacttcc 
agtttccgctATAAA (KanMX4-Mu) ATAAAtaatggcagc 
aaaggaattgCTAGG (KanMX4-Mu) CTAGGggcattactc 
aaaaataattACTCT (KanMX4 -Mu) ACTCTaacatttctt 
tgtttatatgATGAC(KanMX4-Mu)ATGACgattttccca 
ttgtgtatttTTGAT (KanMX4-Mu) TTGATtgaaaatgat 

tatgataatcAAGGC(KanMX4-Mu)AAGGCataattgact 
cagcattaaaACGGC (KanMX4 -Mu) ACGGCagcaaagccc 
ttgacatgtgATCTG {KanMX4-Mu) ATCGTcacagatttt 
tcagctctcaGCAGA (KanMX4-Mu) GCAGAgaaaaaattt 
tgctaggtgtGTCTG (KanMX4-Mu) GTCTGtttatgcatt 
caattgaggtTTGAA (KanMX4 -Mu) TTGAAattgctggcc 
aatcatgcatTGCAT (KanMX4-Mu) TGCATaatgtggtat 
acgatcttacGTCGG (KanMX4-.Mu) GTCGGctatctcacc 
ttgtatttaaACTGG (KanMX4-Mu) ACTGGagtgatttat 
tgcatatttgCCTGC (KanMX4-Mu) CCTGCgaaaaaaagt 
tcgttgaataATGGA ( KanMX4 -Mu) ATGGAaaatatgaaa 



site duplications in 

liocation* 



chrl3 : 908424-908428 
chr9 : 279340-279344 
Chrl6 : 569334-568338 
Chrl2: 239388-239392 
2vi : 3447-3451 {NC_001398) 
chr4 : 825525-825529 
chrl6 : 862127-862131 
Chr3 ; 263950-263954 
Chr2 : 766314-766318 
chrll: 308515-308519 
Chr7: 854983-854987 
Chr5 : 327111-327115 
Chrl2 : 456350-456354 
2]x: 2720-2724 
chrlO : 702930-702934 
Chr7 : 568606-568610 
Chrl: 136875-136879 
chr 10: 241383-241387 
chr4: 276367-276371 
Chrl3 : 904363-904367 
chr9 : 249583-249587 
chr4 : 544898-544902 
chrl2 : 65144-65148 
chr9: 138283-138287 
Chrl5 : 892270-892274 
chrl6 : 67656-69660 
chrl2 :453865-453869 
chrl4 : 661338-661342 
chrl5 : 720163-720167 
2]i;2838-2842 
Chris : 836789-836793 
chrl2 :456583-456587 
Chrl2 :458164-458168 
chrlO : 135624-135628 
chrl5 : 829039-829043 
Chr6 : 44321-44325 
211:2838-2842 
chrlO : 526881-526885 
chrl2 :455126-455130 
chrl2 :453213-453217 
chrl4 : 736161-736165 
chrl4: 566860-566864 
chrlO : 161496-161500 
chrl2 : 912615-912619 
chrl6: 120160-120164 
chrll :306835-306839 
Chr4 : 600461-600465 
Chr2 :429112-429116 
chrl6: 826635-82 6639 
2vi:5268-5272 
chr2 : 117272-117276 
chrl4:331432-331436 
chr. 12 :455361-455365 
211:2196-2200 
chr3 : 77666-77670 
2vi A:5800-5804 
chr5 :436799-436803 
chrlO : 187594-187598 
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Table 3A (Continued) 



G76 
Gil 
Gia 
G79 
G80 
G81 
G83 
G84 
G85 
G87 
G88 
G90 
G94 
G96 
G97 
G98 
G99 
GlOO 
GlOl 
G102 
G103 
G104 
G105 
G106 
G107 
G108 
G109 
GllO 
Gill 
G113 
G114 
G116 
G117 
G118 
G120 
G121 
G122 
G123 
G124 
G125 
G126 
G127 
G128 
G129 
G130 
G131 
G132 
G133 
G134 
G135 
G136 
G137 
G138 
G140 
G150 
Tl 
T2 
T3 
T4 



ctttcccagaACCAG (KanMX4 -Mu) ACCAGggaaactgtt 
cctctgcatcCCAAC (KanMX4-Mu) CCAACaccagcgata 
atctgtaaacTCGCT {KanMX4-Mu) TCGCTtgtgacgatg 
tcctgcctaaACAGG (KanMX4 -Mu) ACAGGaagacaaagc 
tagaaaaaacCACAA(KanMX4-Mu) CACAAcaacactatg 
t tttggctcgrCCGG {KanMX4 -Mu) TCCGGatgatgcgaa 
tgtggctaccGCCCG (KanMX4-Mu) GCCCGtgattcgggc 
ggcatagtgcGTGTT (KanMX4-Mu) GTGTTtatgcttaaa 
aaaatgcaacGCGAG (KanMX4-Mu) GCGAGagcgctaatt 
gaacagttccACGCC (KanMX4-Mu) ACXSCCtgatatgagg 
agcgcgactgCCCGA (KanMX4 -Mu) CCCGAagaaggacgc 
aaaaggttcaGTAGA(KanMX4-Mu)GTAGAaacataaaat 
ccacaaggacGCCTT{KanMX4-Mu)GCCTTattcgtatcc 
cagaatccatGCTAG (KanMX4-Mu) GCTAGaacgcggtga 
cagctgctacCCAGG (KanMX4-Mu) CCAGGgattgccacg 
ctagccgttcATCAA{KanMX4-Mu)ATCAAtcatgtcaaa 
caaaaaagtcTAGAG (KanMX4-Mu) TAGAGgaaaaaaacg 
t tgtcaaagt ACCGA ( KanMX4 -Mu) ACCGAtcatgacaat 
gtaacatcttGGGCG (KanMX4-Mu) GGGCGtttgcaacac 
actgcctttgCTGAG(KanMX4-Mu) CTGAGctggatcaat 
aatgtaaaagGCAAG (KanMX4-Mu) GCAAGaaaacatgta 
gcctgaattgTAGAT (KanMX4-Mu) TAGATattagataag 
gtttgacattGTGAA (KanMX4 -Mu) GTGAAgagacataga 
tgtcatctacATCAT (KanMX4 -Mu) ATCATcggtat tatt 
cttgttcctaGTGGC (KanMX4.Mu) GTGGCgctaatggga 
agggccctcaGTGAT (KanMX4-Mu) GTGATggtgttttgt 
ggtattttcaTTGGT (KanMX4-Mu) TTGGTtgtaaaatcg 
caatctaaccACCAT (KanMX4 -Mu) ACCATgttggctcac 
cgaaaaatgcACCGG (KanMX4-Mu) ACCGGccgcgcatta 
ttacgatctgCTGAG (KanMX4-Mu) CTGAGattaagcctt 
aaatcgagcaATCAC {KanMX4-Mu) GTGATtgctcgattt 
ccgacaaaccCCCCC (KanMX4-Mu) CCCCCcatttatata 
caataagatgTGGGG (KanMX4-Mu) TGGGGattagtttcg 
gtttaacgctTCCTG (KanMX4 -Mu) TCCTGggaactgcag 
atgaatactcCTCCC (KanMX4-.Mu) CTCCCttgctgttgg 
aatcacaatgGCGGC (KanMX4-Mu) GCGGCcatcgaccct 
gagcaccacgATCGT (KaiiMX4-Mu) ATCGT t egg tg tact 
aaaagcattcTGCAG (KanMX4-Mu) TGCAGtaattagccg 
gtgattctccATGGG (KanMX4 -Mu) ATGGGtggtttcgct 
gctggtccagACCAC {KanMX4 -Mu) ACCACaaaaggatgc 
acttcgacttCGGGT (KanMX4-Mu) CGGGTaaaatactct 
tgacattaatCCTAC (KanMX4-Mu) CCTACgtgacttaca 
tttatatccgGTGGT (KanMX4-Mu) GTGGTtgcgataagg 
ctgatgtgcgGTGGT (KanMX4-Mu) GTGGGccttggactt 
gttgaactacTACGG (KanMX4-Mu) TACGGttaagggtgc 
cctatactctACCGT { KanMX4 -Mu) ACCGTcagggttgat 
aactagcaaaATGGA (KanMX4 -Mu) ATGGAaacaaaaaaa 
ttgactcaacACGGG (KanMX4 -Mu) ACGGGgaaactcacc 
cattgtgaccCTGGC (KanMX4-Mu) CTGGCaaatttgcaa 
atacagctcaCTGTT {KanMX4-Mu) CTGTTcacgtcgcac 
tcagatttttCCCAG (KanMX4-Mu) CCCAGtatggctttg 
tttaacgtggGCGAA (KanMX4 -Mu) GCGAAgaagaaggaa 
ccattccataTCTGT {KanMX4-Mu) TCTGTtaagtataca 
ctttgtgcgcTCTAT{KanMX4-Mu)TCTATaatgcagtct 
aattggtacaGTATG (KanMX4-Mu) GTATGctcaaaaata 

ttgtagcttcCACAA (Mu-KanMX4-pl5A.Mu) CACAAgatgttggct 
tcttattctcCTGTT (Mu-KanMX4^pl5A-Mu) CTGTTgccttcgtac 
cggttgtataTGCAT {Mu-KanMX4 -pl5A-Mu) TGCATtgtacgtgcg 
ttttaataagGCAAT (Mu-KanMX4-pl5A-Mu) GCAATaatattaggt 



Chrl4: 537718-537722 
chr4 : 955105-955109 
chr4 : 480341-480435 
chrl4 : 547141-547145 
chrlO: 111531-111535 
Chr . 16 : 641397-641401 
clir4 : 1433822-1433826 
2>i:541-545 
2p:3134-3138 
chrll: 60765-60769 
chr4 : 1056229-1056233 
chrll :430889-430893 
chrl2 : 451993-451997 
Chrl2 :452043-452047 
chr2 :415433-415437 
chr4 :539356-539360 
Chrl3 :406197-406201 
Chr5: 258808-258812 
Chrl6 : 135372-135376 
2p:2524-2528 
Chr4 : 1011940-1011944 
Chris : 770712-770716 
chrl2 :452744-452748 
Chr4 : 1160847-1160851 
chr4 : 464844-464848 
2VL B:4396-4400 
chrl2 1582690-582694 
Chris ; 75760-75764 
2vl: 5427-5431 

chrl2 :451812-451816 
2)1:2126-2130 
Chris :1039713-1039717 
chrl3 : 895900-895904 
chrl6 :30277-30281 
chrl4 : 175588-175592 
chrl2 : 1030933-1030937 
Chrl3: 67812-67816 
Chris : 638922-638926 
chrl4 :333823-333827 
chrl3 :540587-540591 
Chrl2 :328174-328178 
Chr5:291453-291457 
chr5:317469-317473 
chr5: 336404 -336408 
chrl6 :40318-40322 
Chrl2 .-453842-453846 
Chr2 : 692001-692005 
chrl2 :456534-456538 
chrl2 :651930-651934 
2p B:4039-4043 
chr7: 976865-976869 
chrll :327312-327316 
chrl2 : 460247-460251 
211:3318-3322 
Chrl2 :492584-492588 
chrl2 -.645643-645647 
chrS: 7908-7912 
Chr5 : 402750-402754 
chrl0:538071-538075 
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Table 3A (Continued) 



T5 tatcacttacTCX3AA(Mu-KanMX4-pl5A-Mu)TCGAAcgttgacatt 
T6 aaagacatctACCGT (Mu-KanMX4 -pl5A-Mu) ACCGTgaaggtgccQ 
T7 catattactgCCCGC (Mu-KanMX4-pi5A-Mu) CCCGCgtaatccaat 

T8 gtgttagtgaATGCC (Mu-K anMX4 -PlSA^Mu) ATGCCtcaaactct H 

Target sxte duplication is typed in capital letters. — 
^Chromosome and the coordinates of the duplicated sequence. 

Table 3B- Transposon integration sites and target site duplications in 
Saccharomyces cerevlslae haploid strain PY-3. 



chrl2 : 864259-864263 
Ghr7 :999996-1000000 
Chris : 304883-304887 
chrlO; 304087-304091 



Gl 

G2 

G3 

G4 

G5 

G6 

G7 

G8 

G9 

GIO 

Gil 

G12 

G13 

G14 

G15 

G16 

G17 

G28 

Q37 

G38 

G39 

G4 0 

G42 

G43 

G44 

G45 

G46 

G47 

G48 

G49 

G50 

G51 

G52 

G53 

G54 

G55 

G56 

G57 

G58 

G59 

G60 

G61 

G62 

G63 

G64 

G65 



^seqrmx4 



seqA- 



aaagagaaaaATAAG (KanMX4-Mu) ATAAGaaaatcttct 
cctttttttcGTGGG (KanMX4-Mu) GTGGGaaccgcttta 
atccacctttGCTGC (KanMX4-Mu) (GCTGCttttccttaa) 
tacattcctcCTCAT (KanMX4 -Mu) CTCATttgaccgagg 
gatttatcatGCAGT (KanMX4-iyiu) GCAGTaatactaata 
gaattttaaGAGAtc (KanMX4-Mu) GAtcAAgtcttgtga 
gttcgatgctGTGCG (KanMX4-Mu) GTGCGggacttctac 
cttcacggtaACGTA(KanMX4-Mu) ACGTAactgaatgtg 
caaggagcagAGGGC {KanMX4 -Mu) AGGGCacaaaacacc 
tcaataaacaGCCGA (KanMX4 -Mu) GCCGAcatacat ccc 
gcgagatgagGTGAA (KanMX4-Mu) GTGAAaagaaactta 
taaatttcatCCGGA {K:anMX4-Mu) CCGGAagaaaaatga 
agaaaagtacAATTc (KanMX4-Mu) gATcAaggttacggc 
actgtcttttCCGGT (KanMX4-Mu) CCGGTcattccaaca 
atacacgctcATCAG {KanMX4-Mu) ATCAGacaccacaaa 
atagtatttcCTAGT (KaniyiX4-Mu) CTAGTgatctcggcg 
ttcctattctCTAGA (KanMX4-Mu) CTAGAaagtatagga 
ttataaggttGTTTC (KanMX4-Mu) gaGTTTCatatgtgttt 
ttcgagagtgCCATT (KanMX4-Mu) CCATTgtaccagact 
atggatggcgCTCAA (KanMX4-Mu) CTCAAgcgtgttacc 
tccaaatgtaTTGTG (KanMX4-Mu) TTGTGagatgaaaat 
atgattatttCACGG (KanMX4-Mu) CACGGatt teat tag 
atggaaaactAGCGC (KanMX4-Mu) AGCGCataattttgt 
gagaatcttgTCTTG (KanMX4-Mu) TCTTGatgtaacaaa 
tagcaaacgTAAGTCTtc (KanMX4-Mu) gAAGTCTAAaggttg 
ttgccgcgaaGCTAC (KanMX4-Mu) GCTACcatccgctgg 
gtagctctttTCCAT (KaniyiX4-Mu) TCCATggatggacga 
atgttcattcTCTGT (KanMX4-Mu) TCTGTagcagtaaga 
aatcgtaaccATAAA (KanMX4 -Mu) ATAAAtataagttcc 
ccttcctgctGTGGG (KanMX4-Mu) GTGGGcagagagcga 
tcttagggttATTGG {KanMX4 -Mu) ATTGGtagggttttg 
agttaacttcCCCGG (KanNX4-Mu) CCCGGtgttcagtat 
atgtgtcattGAGGG (KanMX4-Mu) GAGGGaaaatgtaat 
ggttaacttgCTCGC (KanMX4-Mu) CTCGCcatatatatc 
caaaaaaagaTGGAG (KanMX4 -Mu) TGGAGtacagtacgc 
gatatttacgCTTAT (KanMX4-Mu) CTTATcaatctctgg 
gccgtggtttCCGGA (KanMX4 -Mu) CCGGAgaaagacgaa 
tttctggaatTAGGG (KanMX4-Mu) TAGGGtgacagaatg 
attactttatTTGGC (KanMX4-Mu) TTGGCtaaagatcct 
cgttatcataTTGAT (KanMX4 -Mu) TTGAtattgcttatt 
ggcaaactatCTCAC (KanMX4-Mu) CTCACcagaggtctg 
ctaatagtgcATGAT (KanMX4 -Mu) ATGATtatatatcaa 
agaaattctcCTTGG (KanMX4-Mu) CTTGGgattagataa 
tcccgcactgGTGAT (KanMX4-Mu) GTGATacctacaccc 
atcattcattGCCGG (KanMX4-Mu) GCCGGaaaaagaaag 
ctcacQCtc tGCGAT (KanMX4-Mu) GCGATtaacagctca 
2t site dunlTo^tHviiTT TZ i-^^-v^^ 7Z • ^ _ n — tt: 



Location* 



chr3 : 38982-38986 
2|i: A:4372-4376 
2vi:5349-5353 
chrl6: 837554-837558 
chr4: 3069-3073 
chrl5 : 144910-144915 
chrl : 191076-191080 
chrl2 :453541-453545 
Chrl2 :454727-424731 
2p:5123-5127 
chr7 : 284048-284052 
chrll:489457-489461 
chr4: 56735-56740 
Chrll : 428648-428652 
chrl2 :453989-453993 
Chris : 989676-989680 
2ij:704-708 
Chris : 854340-854344 
Chr8 :489155-489159 
Chrl2 :453865-453869 
Chris : 834888-834892 
chrl3 : 97657-97661 
chr4 :437081-437085 
Chr7 : 190765-190769 
chrl2 :459205-459213 
chrl2 :452091-452095 
Chrl2 : 645493-645497 
chrlO -.337762-337766 
chr2 :806825-806829 
Chr7 : 739278-739278 
chr9: 382384-382388 
chrl2 : 1025073-1025077 
Chr7 : 798084-79 8088 
Chr2 : 657457-657461 
Chr2 :466108-466112 
chr2 : 80588-80592 
chrl3 : 347229-347233 
Chr4 : 722468-722472 
Chr4 : 600407-600411 
Chris : 696010-696013 
chrlO : 117057-117061 
chr7 : 853604-853608 
chrS : 137549-137553 
chrl2: 213298-213302 
chrl2 :370966-370970 
ChrlO :404834-404838 



Target site duplication is typed in capital letters, 
*Chromosome and the coordinates of the duplicated sequence. 
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Table 4. Nucleotide consensus of the sequenced duplicated pentamers, 
(Percentage) 

FY1679 (n=121): 



Nucleotide 1 


2 


3 


4 


5 


A 34 (28) 
C 31 (26) 
Q 28 (23) 
T 28 f23) 


10(8) 
58 (48) 
11(9) 
42 f35) 


13 (11) 
45(37) 
49 (41) 

14 fl2) 


47 (39) 
8 (7) 
53(44) 
13 (11) 


27 (22) 
27 (22) 
36 (30) 
31 (26) 


Consensus: N 


C/T 


C/G 


A/G 


N 


FY-3 (n=42): 
Nucleotide 1 


2 


3 


4 


5 


A 8 (19) 
C 14 (33) 
G 12 (28) 
T 8 (19) 


3(7) 
15(36) 

3(7) 
21 f50) 


6(14) 
11 (26) 
18 (42) 

7(18^ 


15 (36) 
1(2) 
22 (51) 
4(10) 


8(19) 
7(17) 
15 (35) 
12 (29) 



Consensus: N C/T C/G A/G N 



FY1679 + FY-3 (n=163^: 

Nucleotide 1 2 3 4 5 



A 
C 
G 
T 


42 (26) 
45 (28) 
40 (25) 
36 (22) 


13 (8) 
73 (45) 
14(9) 
63 (39) 


19 (12) 
56 (34) 
67 (41) 
21 (13) 


62 (38) 
9(6) 
75(46) 
17 (10) 


35 (21) 
34 (21) 
51(31) 
43(26) 


Consensus: 


N 
N 


C/T 




C/G 
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Table 5. Transposon integration sites and target site diq>lications in HeLa cells. 

SiSSe I^catinnv- 

RGC16 agSaggaa^CCAG(Mu/LoxP-Kan/Neo)ACCAGgcacatgctg chr8 : 128251032-128251036 

RGC26 ttaaatgaacTTCAG(Mu/LoxP.Kan/Neo)TrCAGgaaaataatg chrl2 : 15381980-15381984 

RGC35 coaatg8gtcACCAGCMu/LoxP-Kan/Neo)ACCAGaactgaacaa chr2 : 180174041-180174045 



Target site duplication is typed in capital letters. 
^Quomosome and the coordinates of the duplicated sequence. 
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